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Abstract 

Female European corn borer, Ostrinia nubilalis, produce and males respond to 
sex pheromone blends with either E- or Z-All-tetradecenyl acetate as the major 
component. E- and Z-race populations are sympatric in the Eastern United 
States, Southeastern Canada, and the Mediterranean region of Europe. The 
E- and Z-pheromone races of O. nubilalis are models for incipient species for- 
mation, but hybridization frequencies within natural populations remain 
obscure due to lack of a high-throughput phenotyping method. Lassance et al. 
previously identified a pheromone gland-expressed fatty-acyl reductase gene 
ipgfar) that controls the ratio of All-tetradecenyl acetate stereoisomers. We 
identified three single nucleotide polymorphism (SNP) markers within pgfar 
that are differentially fixed between E- and Z-race females, and that are >98.2% 
correlated with female pheromone ratios measured by gas chromatography. 
Genotypic data from locations in the United States demonstrated that pgfar-z 
alleles were fixed within historically allopatric Z-pheromone race populations in 
the Midwest, and that hybrid frequency ranged from 0.00 to 0.42 within 11 
sympatric sites where the two races co-occur in the Eastern United States 
(mean hybridization frequency or heterozygosity (Ho) = 0.226 ± 0.279). Esti- 
mates of hybridization between the E- and Z-races are important for under- 
standing the dynamics involved in maintaining race integrity, and are 
consistent with previous estimates of low levels of genetic divergence between 
E- and Z-races and the presence of weak prezygotic mating barriers. 



Introduction 

Biological traits that influence reproductive biology, 
chemical communication, and mate attraction can influ- 
ence gene flow within a species, maintain phenotypic 
diversity among ecotypes, and may ultimately contribute 
to sympatric speciation. Key mutations that establish 
prezygotic mating barriers effectively decrease gene flow 
within previously panmictic populations and allow the 
subsequent accumulation of divergent life history traits 
(Roelofs and Rooney 2003). During the early stages of 
diversification, incipient species often maintain high levels 
of gene flow, such that introgression occurs in regions of 
the genome not linked to genes directly involved in speci- 
ation (Wu 2001; Lassance et al. 2010). Incipient species 



can retain genomic regions with shared ancestral poly- 
morphisms, as well as regions with derived mutations at a 
few key loci that are fixed differentially among divergent 
lineages (Gentile et al. 2002; Machado et al. 2002; Roelofs 
and Rooney 2003). Incomplete barriers to gene flow and 
consequent introgression can lead to homogenization of 
genomic regions not linked to genes involved in specia- 
tion. Because genetic drift and selection can lead to diver- 
gent molecular signals and phenotypic traits unrelated to 
the mechanisms involved in speciation, analyses of incipi- 
ent species provide not only opportunities, but unique 
challenges for the study of contemporary speciation. 

The European corn borer, Ostrinia nubilalis, is a 
polyphagous lepidopteran insect native to Eastern Europe 
and Western Asia, but was inadvertently introduced to 
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North America in the early 1900s (Vinal 1917). Pheno- 
typic variation includes females that produce, and males 
that respond to, pheromone blends of 99:1 or 3:97 E-: Z- 
All-tetradecenyl acetate (£11- and Zll-14:OAc) (Klun 
and Brindley 1970; Kochansky et al. 1975; Roelofs et al. 
1972). The geographic distribution of O. nubilalis that 
dominantly use the Zll-14:OAc isomer for sexual com- 
munication (Z-race) extend across the eastern two thirds 
of the United States, southeastern Canada, and Europe 
(Klun and Cooperators 1975). In contrast, populations of 
the ii-race are restricted to the Eastern United States 
(O'Rourke et al. 2010), southeastern Canada (J. Smith, 
unpubl. data), and the Mediterranean region of Europe 
(Anglade and Stockel 1984). Limited gene flow between 
O. nubilalis pheromone races has been suggested from 
molecular analyses (Harrison and Vawter 1977; Carde 
et al. 1978; Cianchi et al. 1980; Glover et al. 1991; Dop- 
man et al. 2005), laboratory choice tests (Liebherr and 
Roelofs 1975), and field collections of Fi hybrid females 
that produce a 65:35 £11- to Zll-14:OAc ratio (Klun and 
Maini 1979; Roelofs et al. 1985; Durant et al. 1995). 

The female O. nubilalis pheromone gland is comprised 
of a single layer of epidermal cells located in the 8th and 
fused 9th/ 10th abdominal segments (Ma and Roelofs 
2002). Both races produce an approximately equal 
proportion of E- and Z-ll-tetradecenyl, the precursors of 
£11- and Zll-14:OAc (Roelofs et al. 1987, 2002). The 
specific £11 to Zll-14:OAc isomer ratios produced by 
female £- and Z-race O. nubilalis is controlled by a single 
codominant genetic locus (Klun and Maini 1979; Roelofs 
et al. 1987; Dopman et al. 2004), but are modified to a 
lesser degree by unlinked genetic loci that were revealed 
by analyses of backcross progeny (Lofstedt et al. 1989; 
Zhu et al. 1996). Subsequent genetic linkage analysis indi- 
cated that the major locus controlling production of the 
£- or Z-isomer mapped to the pheromone gland fatty- 
acyl reductase gene, pgfar (Lassance et al. 2010), and that 
two alleles, pgfar-e and pgfar-z, are differentially expressed 
in the pheromone gland of £- and Z-race females, respec- 
tively. Any barrier to gene flow between O. nubilalis 
E- and Z-race field populations may be reinforced by a 
reduced ability of Fi males to locate females of either race 
(Glover et al. 1991), which suggests a coevolution 
between male and female reproductive traits (Lassance 
2010). Despite the known association between pgfar and 
pheromone isomer production, questions remain regard- 
ing this relationship in natural populations and the level 
of intrarace variation at the pgfar locus (Lassance 2010). 

The degree of gene flow in a hybrid zone can be inter- 
preted as a measure of species divergence, and can be 
important for understanding speciation mechanisms. 
Female O. nubilalis pheromones (phenotypes) can be 
determined via separation of pheromone gland-produced 
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hydrocarbons by gas chromatography (GC) analysis and 
comparison to synthetic standards. GC analyses are often 
difficult to apply within large population studies because 
the pheromone gland must be dissected from virgin 
females during scotophase when the pheromone titer is 
highest (Smith et al 1991). Male O. nubilalis can be 
collected in traps baited with either the synthetic £- or Z- 
pheromone blend, but the lures show reduced fidelity to 
race when imprecise formulations are used (Bartels et al. 
1997; Mason et al. 1997; Pelozuelo and Frerot 2008). 
Furthermore, identifying hybrid males captured in traps 
baited with either kind of lure requires GC analysis of the 
pheromone produced by female offspring from controlled 
backcrosses, a slow and logisticaUy challenging undertak- 
ing. A fast and accurate molecular diagnostic assay would 
be of great value to those seeking to phenotype an indi- 
vidual's race, not only in studies of race interactions and 
genetic isolation, but in any ecological or behavioral study 
of this species conducted in areas of sympatry. 

A single nucleotide polymorphism (SNP) is a single 
base substitution at a genomic locus, and SNPs are useful 
tools for population and genetic mapping studies (Glau- 
bitz et al. 2003). SNPs can be detected by automated 
genotyping assays (Tsuchihashi and Dracopoli 2002) or 
low- throughput methods (Vignal et al. 2003). Molecular 
genetic markers within or linked to genes affected by a 
recent selective sweep can be associated with divergent 
traits, and thus used to predict individual phenotypes in 
natural environments (Schulze and McMahon 2004). In 
this study, we identified SNP markers in O. nubilalis asso- 
ciated with pgfar-e and pgfar-z alleles previously provided 
by work by Lassance et al. (2010), and verified their pher- 
omone race specificity by direct comparison to phero- 
mone phenotypes determined by GC analyses. We then 
developed and applied a SNP assay to estimate rates of 
hybridization between £- and Z-pheromone races at 
several locations in the Eastern United States. 

Material and Methods 

Development of pheromone race-specific 
molecular genetic markers 

GenBank nr database accessions GU808256 to GU808276 
originally submitted by Lassance et al. (2010) which con- 
tain cDNA sequences derived from the O. nubilalis pgfar 
gene were downloaded in FASTA format, and aligned 
using the MEGA 5.0 DNA sequence alignment utility 
(Tamura et al. 2007) using the default parameters of the 
ClustalW algorithm (gap opening penalty 15, gap exten- 
sion penalty 6.66, weight matrix lUB, and transition 
weight of 0.5). Nucleotide diversity {d) was estimated 
using MEGA 5.0 (Tamura et al. 2007), and 100-bp sHding 
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windows were iterated across the aligned cDNA sequences 
in 2 5 -bp increments using Python scripts of DNAux 3.0 
(http://www.portugene.com/software.html), and Tajima's 
D was estimated from each window using MEGA 5.0 
(Tamura et al. 2007). The sliding window analysis closely 
approximated that previously provided by Lassance et al. 
(2010), but was replicated within this study to accurately 
determine the position of SNPs. 

Fixed sequence variation was identified between aligned 
pgfar cDNA sequences derived from E- and Z-pheromone 
races, and oligonucleotide primer pairs were designed 
from conserved flanking regions using Primer3Plus 
(Untergasser et al. 2007). Specifically, oligonucleotide 




Figure 1. Adult Ostrinia nubilalis moths (female [left] and male 
[rigint]) from tine United States population. Photograph by Marlin E. 
Rice. 



primer pairs were designed to PCR-amplify products con- 
taining pgfar-e and pgfar-z specific SNPs. Adult O. nubil- 
alis (Fig. 1.) were collected by light trap at Crawfordsville, 
lA (n = 24), located in the Midwest region of the United 
States, where only Z-race populations are present 
(Table 1; Fig. 2). Samples from a laboratory colony of 
bivoltine i:-race O. nubilalis, BENY, were obtained from 
C. Linn (Cornell University, Ithaca, NY). All DNA was 
extracted from the thorax of individual O. nubilalis adults 
as described by Coates and Hellmich (2003), quantified 
on a NanoDrop 2000 (Thermo Scientific, Wilmington, 
DE), and diluted to 10 ng//iL with nuclease free water. 

SNPs within the pgfar gene were detected by three 
independent polymerase chain reaction-restriction frag- 
ment length polymorphism (PCR-RFLP) assays. DNA 
samples were amplified in 10-/iL PCR solutions that 
included 1 x Thermal polymerase buffer (Promega, Madi- 
son, WI), 2.5 mmol/L MgCh, 50 //mol/L dNTPs, 20 ng 
DNA, 1.5 pmol of each primer (Table 2), and 0.3125 U 
GoTaq DNA polymerase (Promega). PCR was conducted 
in a Tetrad2 thermocycler (BioRad, Hercules, CA) pro- 
grammed for 96°C for 3 min, followed by seven initial 
touchdown (TD) cycles consisting of 96°C for 20 sec, 
65°C for 30 sec (-2°C/cycle each subsequent cycle), and 
72°C for 30 sec. Final PCR amplification took place with 
32 cycles of 96°C for 20 sec, 52°C for 30 sec, and 72°C 
for 30 sec. PCR product in the entire reaction volume 



Table 1. Estimates of hybridization of Ostrinia nubilalis E- and Z-race within field populations in the United States. 



Ho among SNP markers 
n 

Site ID Collection site State M/F 7aql Wdell Mse\ He (all SNPs) P-value (all SNPs) 



1 


Mead 


NE 


24/24 


0.000 


0.000 


0.000 


Monomorphic 




2 


Brookings 


SD 


12/12 


0.000 


0.000 


0.000 


Monomorphic 




3 


Kanawha 


lA 


24/24 


0.000 


0.000 


0.000 


Monomorphic 




4 


Crawfordsville 


lA 


24/24 


0.000 


0.000 


0.000 


Monomorphic 




5 


Lexington 


KY 


10/14 


0.000 


0.000 


0.000 


Monomorphic 




6 


Snyder Co. 


PA 


13/11 


0.083 


0.083 


0.083 


0.156 


0.126 


7 


Beltsville 


MD 


17/14 


0.419 


0.419 


0.419 


0.337 


0.295 


8 


Newark 


DE 


44/44 


0.193 


0.193 


0.193 


0.347 


<0.001 


9 


Cohansey 


NJ 


24/24 


0.125 


0.125 


0.125 


0.281 


<0.001 


10 


Sergentsville 


NJ 


11/13 


0.208 


0.208 


0.208 


0.311 


0.152 


11 


Little York 


NJ 


24/24 


0.125 


0.125 


0.125 


0.118 


1.000 


12 


Yates 


NY 


3/7 


0.200 


0.200 


0.200 


0.506 


0.082 


13 


Aurora 


NY 


10/7 


0.059 


0.059 


0.059 


0.059 


1.000 


14 


Spencer 


NY 


9/11 


0.100 


0.100 


0.100 


0.097 


1.000 


15 


Freeville 


NY 


5/6 


0.182 


0.182 


0.182 


0.173 


1.000 


16 


Hartford 


NY 


6/4 


0.000 


0.000 


0.000 


Monomorphic 




C 


BENY colony^ 


NY 


44 


0.000 


0.000 


0.000 


Monomorphic 





Observed and expected heterozygosity at diagnostic pgfar SNP loci assayed by PCR-RFLP from Ostrinia nubilalis adults collected by light traps at 

indicated locations. Site IDs correspond to those on the map in Figure 1. BENY colony adults are included as a control for monomorphic f-race. 

NA = not applicable. 

'Significant at 0.05 level. 

^Not included in pairwise F^-j analyses. 
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Figure 2. Map of Ostrinia nubilalis sample collection sites from 
infested regions of North America. Approximate geographic region of 
sympatric f- and Z-race populations that use a pheromone blend of 
predominantly f- or Z-A1 1 -tetradecenyl acetate, respectively, are 
indicated by the shaded area along the East Coast of the United 
States. Locations 1 to 5 are in geographic regions where historically 
only the Z-race has been found, whereas locations 6 to 19 are in 
regions of sympatry. 

was digested with a restriction endonuclease (RE) by add- 
ing 2.0 fiL 10 X buffer and 0.5 U of Msel (New England 
BioLabs, Ipswich, MA), or Ndell or Taql (Promega) in a 
20-/iL total reaction volume (Table 2), then incubated at 
37°C (or 65°C for Taql) for 14 h. Entire RE digest 
reaction volumes were loaded into 10-cm 3% agarose gels 
and separated at 80 V for 2 h. Resulting fragment sizes 
were estimated by comparison to a 50-bp ladder (Pro- 
mega), and compared to those predicted for pgfar-e and 
pgfar-z alleles (Table 2). Differences between the number 
of observed and expected genotypes at CrawfordsviUe, lA 
(historically known to be Z-race) and BENY colony indi- 
viduals (pure ii-race colony provided by Dr. C. Linn) 
were tested for significance with Chi-square {y^) tests. 
Expected heterozygote frequencies for the CrawfordsviUe, 
lA and BENY samples were calculated using Arlequin 
3.5.1.2 (Excoffier and Lischer 2010), and were performed 
as an initial test of allelic variation among groups of indi- 
viduals with an a priori known phenotype [E- or Z-race). 



Correlation between genotype and 
pheromone production 

To test the fidelity of the pgfar SNP markers for discrimi- 
nating O. nubilalis females producing E-, Z-, and hybrid 
pheromone blends, both SNP genotype and GC-analyzed 
pheromone component data were collected from individ- 
ual F4 females derived from six independent intercrossed 
families. Specifically, Families 1 to 6 were each initiated 
by pooling -200 adults from a laboratory colony of pure 
bivoltine Z-race O. nubilalis maintained at the USDA- 
ARS, Corn Insects and Crop Genetics Research Unit, 
Ames, lA, and -200 adults from the BENY colony (pure 
bivoltine i:-race). The BENY and USDA colonies served 
as controls. All families were maintained as random mat- 
ing populations of >1000 individuals until the F4 genera- 
tion. Individual female F4 pupae were placed in 28-mL 
plastic cups along with a piece of moistened cotton dental 
wick and allowed to emerge as adults in growth chambers 
at 25°C and 16:8 (L:D). Pheromone glands were removed 
and volatile compounds extracted using methods similar 
to Durant et al. (1995). In brief, pheromone ring glands 
were excised with micro-scissors at the terminal segment 
just anterior of the ring gland, during the 6th hour of 
scotophase (time of peak pheromone recovery, C. Mason, 
pers. comm.) from adult females the second day after 
eclosion (24-48 h old). Each gland was placed into a 
50-/(L point-tipped auto-sampler vial containing 5 fiL of 
heptane and an internal standard of 4.5 ng cis-7-tetrade- 
cenyl acetate (7-TDA). 0053amples were held for >30 min 
at room temperature or stored in a — 20°C freezer before 
analysis. DNA was extracted from the thorax of the same 
females, quantified, and diluted as described above. DNA 
samples were processed for genotyping at the pgfar PCR- 
RFLP marker loci using the restriction endonucleases 
Taql, Ndell, and Msel as described above. 

A 10-/.iL Varian 8200 auto-sampler syringe using a sand- 
wich technique with a 0.5 /iL upper air gap was used to 
sample 3 /iL of each pheromone extract and inject at a rate 



Table 2. Oligonucleotide primer pairs that anneal within the pheromone gland fatty-acyl reductase, pgfar, gene of Ostrinia nubilalis to prime the 
PCR amplification of regions that contain pheromone race-specific single nucleotide polymorphisms (SNPs). 



Name 


Oligonucleotide primer 


SNP 


RE 


Allele specific 


pgFAR-tf 


B'-nC GAT TCG GGA ACC CAT A-3' 


j857 


7"aql(+) 


pgfar-z 


pgFAR-tr 


B'-AGG nC GCA ACG TGG TCT AC-3' 


q857 


7"aqi(-) 


pgfar-e 


pgFAR-mnf 


B'-G GGC AAC AAA GGA GTC AAG GT-3' 


j995 


Wdell(+) 


pgfar-e 


pgFAR-mnr 


B'-CC AAA ATA JTJ CCT GTA TTT TAW GCA-3' 


q995 


Wdell(-) 


pgfar-z 


pgFAR-mnf 


B'-G GGC AAC AAA GGA GTC AAG GT-3' 


j1005 


M5el(+) 


pgfar-e 


pgFAR-mnr 


B'-CC AAA ATA TTT CCT GTA TTT TAW GCA-3' 


Q1005 


Mse\{-) 


pgfar-z 



The SNPs are indicated as the nucleotide position within the pgfar cDNA and alternate nucleotides present at the locus (nucleotide p°^'"°"), and 
the restriction enzyme (RE) used to detect each allele are indicated. G/T^^^ and G/J^°°^ are detected by PCR-RFLP of the same PCR-amplified frag- 
ment. 

7aql(-) 1B0 bp, Taq\{+) 118 + 32 bp; Wdell(-) 14S bp, Wdell(+) 83 + 62 bp; Msel(-) 14B bp, Msel(+) 93 + B2 bp. 
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of 1.5 /iL/sec into a Varian 3500 Gas Chromatograph (Agi- 
lent Teclinologies, Santa Clara, CA). The GC was equipped 
with a heated injector fitted with a 4 mm ID open-top glass 
uniliner (Restek Corp., Bellefonte, PA) containing glass 
wool, a fused sUica capillary column 15 m x 0.25 mm with 
0.25 jim Stabilwax® (Restek Corp.) film thickness, a 5 m 
X 0.25 mm fused silica guard column, and a flame ioniza- 
tion detector. The gas chromatograph was programmed for 
a 20 min run time as follows: injector temperature 200°C, 
splitless for 1.5 min, then split for the remainder of the run 
(split ratio 50:1 at 60°C); detector temperature 250°C, 
attenuation set at 32^'^; column oven programmed at 
80°C held for 2 min, temperature ramp from 60 to 240°C 
at 10°/min, held at 240°C for 5 min to end of the run. 
Hydrogen was used as the carrier gas at a flow rate of 
20 cm/sec (6.5 psi head pressure) and nitrogen was used as 
makeup gas. Under these conditions, the 7-TDA internal 
standard and the two pheromone isomers (E- and Z-11- 
tetradecenyl acetate) eluted at w 13.1-13.5 min with each 
of the three peaks separated by 0.2 to 0.4 min. Chromato- 
grams were used to estimate the ratio of E- to Z-ll-14:OAc 
isomers by comparing the area under the isomers' peaks at 
the appropriate retention times. Samples containing insuf- 
ficient quantity of pheromone for detection, as indicated 
by the lack of peaks at the appropriate retention times, 
were not classified and were indicated as unresolved (U). 

Each F4 family (1-6) was treated as an independent 
replicate, and a Pearson product-moment correlation 
coefficient (PMCC) was estimated between phenotype 
and genotype within and across families using SAS 9.2 
(SAS Institute, Gary, NC). Female phenotype determined 
from pheromone gland component separation on the GC 
(>90% £ll-14:OAc isomer = 1.0, <5% £ll-14:OAc iso- 
mer = 3.0, and all remaining estimated ratios = 2.0) was 
paired with corresponding pgfar genotypes from the same 
female {pgfar-e/pgfar-e = 0.0, pgfar-e/pgfar-z = 0.5, and 
pgfar-z/pgfar-z = 1.0). Strength of the linear relationship 
between the dependent {pgfar genotype) and independent 
variables (female GC isomer designation) were evaluated 
within each colony and across all intercrossed lines and 
pure E- and Z-race laboratory control colonies. 

Estimates of field hybridization among 
pheromone races 

Adult O. nubilalis were collected using light traps at 16 
locations from the Midwest and East Coast of the United 
States (Table 1; Fig. 2), where sites 1 to 5 were from geo- 
graphic regions historically known to have pure Z-race 
populations. In contrast, collection sites 6 to 16 within 
the East Coast region of the United States were antici- 
pated to have populations comprised of both E- and 
Z-race moths. The sex of each moth was determined visu- 
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ally, and DNA was extracted as described by Coates and 
Hellmich (2003). Individuals were genotyped at the three 
pgfar SNP markers assayed with Taql, Ndell, and Msel as 
described above. Observed heterozygosity (Hq) and 
expected heterozygosity (H^) and significance of the devi- 
ations from Hardy-Weinberg equilibrium (HWE) were 
tested for each pgfar SNP marker within all populations 
with Markov chain exact tests using the Arlequin 3.5.1.2 
(Excoffier and Lischer 2010). Differences between male 
and female Hq estimates were evaluated by Chi-square 
iX^) tests using SAS 9.2 (SAS Institute). Hierarchical 
_F-statistics and analysis of molecular variance (AMOVA) 
were also generated using Arlequin 3.5.1.2 (Excoffier and 
Lischer 2010) after subdividing populations into Midwest 
(sites 1 to 5) and East Coast (sites 7 to 16) groupings. 
Significance was evaluated based on 1000 permutations as 
described by Weir and Cockerham (1984). Pairwise Fst 
estimates between sample sites were calculated by Arle- 
quin 3.5.1.2 (Excoffier and Lischer 2010), and significance 
thresholds set at a modified (x according to the B-Y 
method (Benjamini and Yekutieli 2001). 

Results 

Development of pheromone race-specific 
molecular genetic markers 

Alignment of cDNA sequences from the O. nubilalis pgfar 
gene produced a 1607-bp consensus sequence predicted 
to contain 1317-bp (439 aa) and 1326-bp (442 aa) coding 
regions for sequences derived from E- and Z-races, 
respectively (Fig. SI). Nucleotide diversity (d) (mean 
number of base substitutions per site) across all pgfar 
cDNAs was estimated as 0.013 ± 0.005. When parti- 
tioned within and between cDNAs derived from Z- and 
E- pheromone races, d was 0.002 ± 0.001 and 
0.011 ± 0.005, respectively. The nucleotide diversity 
index (n) and Tajima's D estimate across the entire pgfar 
coding sequence was 0.020 and 1.614, respectively. Esti- 
mates of Tajima's D within 100-bp sliding windows 
across the pgfar coding sequence ranged from 1.514 to 
2.778, and revealed four regions of significant sequence 
divergence between Z- and _E-race O. nubilalis (signifi- 
cance threshold set at 2.0 as described for the same anal- 
ysis performed by Lassance et al. 2010; Fig. 3). Three 
putative pgfar allele-specific polymorphisms (SNPs) were 
identified in these regions of significant Tajima's D esti- 
mates that were also within restriction endonuclease (RE) 
sites (Fig. 3). Nucleotide positions G*^", T''''^ and T"""^ 
were fixed among cDNA sequences from £-race (pgfar-e 
alleles), and T**'^ C''''', and G^°°^ nucleotides were fixed 
among cDNAs derived from Z-race {pgfar-z alleles). Nu- 
cleotides T**", T*'^^, and T^°°^ complete palindromes for 
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Figure 3. Estimates of directional selection using Tajima's D by analysis of 100-bp sliding windows with 25-bp iterations across the Ostrinia 
nubilalis pheromone gland fatty-acyl reductase (pgfar) gene. This figure was adapted from a similar analysis by Lassance et al. (2010) to show the 
location single nucleotide polymorphisms (SNPs) detected by markers at TaqI, Wdell, and Msel restriction endonuclease sites. 



Taq\ (TCGA), NdeW (GATC), and Msel recognition sites 
(TTAA). 

Oligonucleotide primers flanking the putative polymor- 
phic p^ar RE sites were designed, and PCR-amplified 
fragments ranged from 145 to 150 bp. Restriction digest 
of the corresponding PCR-amplified products by Taq{, 
Ndell, and Msel generated 118 -I- 32-bp, 83 + 62-bp, and 
93 + 52-bp PCR-RFLP fragments, respectively (Fig. S2). 
The fragment sizes estimated from electrophoretic RFLP 
patterns were approximately the same as predicted from 
DNA sequence data for each allele (Table 2 footnotes). A 
pure 2-race population from Crawfordsville, I A (n = 48) 
and samples from the pure £-race BENY colony (n = 44) 
were genotyped at the pgfar loci to provide preliminary 
validation of the PCR-RFLP assay. The results for the 
CrawfordsvOle, lA Z-race samples (Table 1) indicated fix- 
ation of the nucleotides T**'^ G'^^ and G'""^ predicted 
for pgfar-z alleles (Fig. SI). In contrast, PCR-RFLP results 
from the BENY colony (Table 1) were fixed for G'*^^ 
T'"'', and t'""^ as predicted for pgfar-e alleles. Thus, the 
Crawfordsville, lA and BENY samples were fixed for alter- 
nate pgfar-z and pgfar-e alleles (Hq = 0), and thus the 
genotypes were able to predict the phenotypes known to 
be fixed within both of these DNA sources (the pure Z- 
race population from Crawfordsville, lA was fixed for the 
pgfar-z allele, and the pure ii-race BENY colony was fixed 
for the pgfar-e allele). 

Correlation between genotype and 
pheromone production phenotype 

The correspondence between PCR-RFLP defined pgfar-z 
and pgfar-e alleles and female pheromone gland-pro- 
duced £11- and Zll-14:OAc ratios were determined for 
segregating F4 females from replicate lines generated by 
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Figure 4. Histogram of f-AI 1-tetradecenyl acetate titers from GC 
analysis of F4 female Ostrinia nubilalis pheromone gland extracts 
presented as bulk analysis across six independent intercrossed families 
initiated between and f- and Z- race parents. 

mass mating of E- and Z-races in the parental genera- 
tion. Pheromone glands were dissected from a total of 
269 female O. nubilalis from Families 1 to 6, of which 
247 provided GC readings from Families 1 (n = 45), 2 
(n = 46), 3 (n = 46), 4 (« = 30), 5 (« = 26), and 6 
(« = 21), as well as control colonies for £-race (« = 17) 
and Z-race (n = 16). Pheromone gland-derived hydro- 
carbons from 251 females were detected by GC (Table 
SI), where the proportion of the _Ell-14:OAc isomer in 
extracts ranged from 2 to 99 across all samples and all 
replicate lines (Fig. 4). A total of 259 female samples 
were genotyped using Taql, Ndell, and Msel PCR-RFLP 
assays, of which 244 samples had both genotypic and 
GC assay results. The proportions of i;ll-14:OAc among 
homozygous pgfar-e, heterozygous, and observed homo- 
zygous pgfar-z genotypes were 98.02 ± 4.24 (« = 42), 
67.58 ± 8.92 [n = 59), and 2.74 ± 6.24 (m = 143), 
respectively. Among the seven female genotypes that 
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showed potential discrepancies between SNP marker and 
GC data, five had Ell-MiOAc ratios that ranged from 
57 to 68 but were genotyped as homozygous pgfar-z at 
the Tflql-assayed SNP marker (T'^^^/T"^^). The same five 
samples were genotyped as heterozygotes (pgfar-el pgfar-z) 
for both Ndell- (G"''''/!''''") and Msel-assayed (G'"""/ 
jioos-j gj^p jjjj^j^g^g cp^bk SI). 

The Pearson product-moment correlation coefficient 
between the proportion of £11 and Zll-14:OAc phero- 
mones produced by female O. nubilalis pheromone glands 
and the number of pgfar-e alleles from Taql, Ndell, and 
Msel SNP marker assays showed a positive relationship. 
Specifically, the correlation coefficient (r) ranged from 
0.8789 to 0.9942 across all markers (Taql, Ndell, and 
Msel) and each replicated line (Family 1 to 6), and all 
were significant (P-values < 0.001; Fig. 5.). Mean correla- 
tion coefficients for data pooled across all lines were high 
for Taql (r = 0.9699 ± 0.0321), Ndell (r = 0.9884 ± 
0.0182), and Msel-assayed SNP markers (r = 0.9884 ± 
0.0182) (P-values < 0.001). 

Estimates of field hybridization among 
pheromone races 

A total of 499 light trap-collected adult O. nubilalis were 
genotyped using the three pgfar SNP (PCR-RFLP) mark- 
ers (Table 1; Fig. 2). The estimated mean pgfar-e allele 
frequencies at SNP loci G''^^ T'^''^ and T'""^ were 
0.204 ± 0.100 (mean Hq of 0.107 ± 0.105; range 0.000- 
0.419) across all populations and across both sexes. 
Sample sites from historically Z-race regions of the 



Midwest United States (Lexington, KY; CrawfordsviUe, 
lA; Kanawha, lA; Mead, NE; and Brookings, SD) were 
fixed for SNPs T'*^^ G'"'^ and G'°°^ and were fixed for 
the pgfar-z allele (Hq = 0) as anticipated from historical 
data. In contrast, light trap samples from 11 East Coast 
locations (sites 6 through 16; Fig. 2) showed an estimated 
mean Hq across sites of 0.154 ± O.IIO, and confirmed 
the prediction that heterogeneous E- and Z-race popula- 
tions were present within this geographic region. SNP 
genotypes from 9 of 11 East coast locations did not sig- 
nificantly deviate from HWE across all SNP marker loci 
{P > 0.082), with significant deviations shown among 
genotypes from Cohansey, NJ (P-value < 0.001), and 
Newark, DE (P- value < 0.001; remaining data not 
shown). When partitioned into male and female samples, 
the mean Hq did not deviate significantly between sexes 
at any location (P-values < 0.05; remaining data not 
shown). Global estimates of P-statistics indicated signifi- 
cant pgfar allele frequency variation among geographic 
regions (Midwest [sites 1 to 5] vs. East Coast [sites 7 to 
16]; PsT = 0.0782; P < 0.001; Table S2). Pairwise Pgr esti- 
mates among light trap collection sites ranged from 
-0.02798 to 0.5156, and 17 of 55 comparisons (30.9%) 
were significant between Midwest and East Coast sample 
sites (P-values < 0.009; B-Y significance thresh- 
old = 0.014) (Table S3). In contrast, seven of 65 pairwise 
PsT estimates (10.8%) were significant among East Coast 
samples. Additionally, significant inbreeding was detected 
within Midwestern and within East Coast samples 
(Fis = 0.3133; P-value < 0.001; Table S2). The significant 
variation between population from the Midwest compared 
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with the East Coast were expected due to historical geo- 
graphic range of i:-race populations within East Coast 
regions. Thus, these results could be considered comple- 
mentary to the laboratory genotype-phenotype associa- 
tion studies performed above. 

Discussion 

Development of pheromone race-specific 
molecular genetic markers 

The two pheromone races of O. nubilalis show partial 
reproductive isolation when in sympatry, and may repre- 
sent incipient species in the early stages of divergence 
(Dopman et al. 2010; Lassance et al. 2010). E- and Z-race 
individuals are morphologically indistinguishable, but 
females from each race can be identified by analysis of 
fatty acid derivatives produced in the pheromone gland 
(Klun and Brindley 1970; Kochansky et al. 1975; Roelofs 
et al. 1972). Correspondence of phenotype with genotypes 
(mutations) has been established for insecticide resistance 
traits (Ffrench-Constant et al. 1993), but population asso- 
ciations can be complicated by effects of inbreeding, pop- 
ulation structure or selection (Berlocher and McPheron 
1996; Baxter et al. 2010). The population association of 
the three molecular genetic markers developed in the cur- 
rent study was facilitated by prior knowledge that O. 
nubilalis pheromone production traits segregating in 
backcross pedigrees are linked to the pgfar locus (Lassance 
et al. 2010). Despite this association between pgfar alleles 
and female pheromone within pedigrees, marker associa- 
tion within populations requires linkage disequilibrium 
(LD) in the face of continued genetic recombination 
(Jorde 2000). The high levels of nucleotide diversity and 
lack of shared ancestral polymorphism between cDNAs 
derived from E- and Z-race O. nubilalis, along with signif- 
icant Tajima's D estimates, suggested to Lassance et al. 
(2010) that directional selection is acting on the pgfar 
locus. 

The three SNP markers within O. nubilalis pgfar that 
were assayed by PCR-RFLP in this study are positioned 
in regions that were significantly affected by directional 
selection (based on sliding window estimates of Tajima's 
D < 2.0). Although this analysis was replicated from that 
previously performed by Lassance et al. (2010), our simi- 
lar analysis allowed positioning of the SNP markers 
within regions showing evidence of directional selection 
(Fig. 2). Furthermore, two of the three SNPs were at sec- 
ond codon positions and caused nonsynonymous (amino 
acid changing) mutations that were fixed differentially 
among E- and Z-race cDNAs. Specifically, the Taql PCR- 
RFLP assay detected a G/T"" SNP that results in a 
deduced Cys to Phe change, and the Ndell PCR-RFLP 



assayed for a T/G SNP that predicts an He to Ser 
change. The T/G^°°^ SNP detected by the Msel PCR-RFLP 
assay is at a 3rd codon position, but is associated with a 
nonsynonymous Val to Met change that is due to 
mutation of the 1st codon position (Val [GTT] to Met 
[ATG]). Initial population screening of known E- and Z- 
race samples, followed by more broad population sam- 
pling and association studies, indicated that the three 
SNP markers are diagnostic for the differentiation of O. 
nubilalis pheromone races collected in the field. Focusing 
on amino acid changing SNPs in population association 
studies has been proposed, because significant associations 
may have a higher probability of being detected at non- 
synonymous sites (Botstein and Risch 2003). The accu- 
mulation of mostly nonsynonymous changes within O. 
nubilalis pgfar suggests that divergent selection has 
resulted in enzymes with derived specificities for the pro- 
duction of E- and Z-stereoisomers. 

Correlation between genotype and 
pheromone production 

SNP markers consist of base substitutions at a single 
genomic locus, where individual mutations are generally 
biaUelic and have lower allele diversities and provide less 
statistical power to discriminate unique genotypes com- 
pared to microsateUite loci (Xing et al. 2005). However, 
SNPs are increasingly being used for population genetics 
and mapping studies due to their abundance, and relative 
ease of discovery and development (Glaubitz et al. 2003; 
Morin et al. 2004). SNP markers were previously devel- 
oped from O. nubilalis expressed sequence tags (ESTs; 
Coates et al. 2008, 2011a), and subsequently applied to 
detect significant levels of genetic differentiation among 
O. nubilalis populations (Coates et al. 2011a) and to 
determine linkage groups associated with Bacillus thurin- 
giensis resistance traits (Coates et al. 2011b). SNPs have 
been used in other species of Lepidoptera to identify pop- 
ulation genetic structure (Margam et al. 2011) and to 
detect genome regions that influence traits (Sreekumar 
et al. 2011). Our data show that the segregation of three 
SNP markers within the O. nubilalis pgfar gene are signifi- 
cantly correlated with the proportion of £ll-14:OAc 
produced in the female pheromone gland. These results 
are consistent with previous analyses of backcross pedigree 
data (Lassance et al. 2010). This evidence also indicates 
that linkage disequilibrium in this genome region between 
O. nubilalis pheromone races may be maintained by dis- 
ruptive selection. The size of the O. nubilalis genome 
region affected by LD remains unknown, but within other 
species where the level of LD is known haplotype blocks 
have been shown to span regions of -500 kb (Hirschhorn 
and Daly 2005). Fourteen putative loci were identified 
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and proposed to be involved in adaptive divergence 
between the species O. nubilalis and O. scapulalis (Mida- 
megbe et al. 2011), but analogous genome wide studies 
have not been reported between E- and Z-race O. nubilalis. 

Despite the strong correlation between pgfar genotype 
and female phenotype, estimated ratios of _Ell-14:OAc in 
F4 females with pgfar-e/pgfar-z genotypes ranged from 53 
to 94 (mean 67.3 ± 9.0), and often deviated from the 
expected 65:35 £11- to Zll-14:OAc hybrid ratio (Klun 
and Maini 1979; Roelofs et al. 1985). Reciprocal crosses 
between E- and Z-race O. nubilalis in previous studies 
indicated that modifier loci may affect female pheromone 
production, causing shifts in heterozygote pheromone 
blend ratios, and were most noticeable in female Fj back- 
crosses to the Z-race (Lofstedt et al. 1989; Zhu et al. 
1996). The effect of segregating modifier loci on female 
pheromone blend ratios may be responsible for the 
skewed ratios we observed among heterozygous F4 
females, but additional experiments will be required to 
determine the genetic loci involved. The rate of mating 
success for Fi hybrid females from United States popula- 
tion that express intermediate pheromone blend ratios 
showed no significant difference in backcrosses to either 
E- or Z-race males in laboratory experiments (Pelozuelo 
et al. 2007), and suggest that these backcross individuals 
may also be present in natural populations. 

Estimates of field hybridization among 
pheromone races 

Mate attraction in O. nubilalis may be a function of inde- 
pendent genetic loci; the Pher {=pgfar) locus that deter- 
mines female pheromone production, and Resp and Olf 
loci that influence male behavioral response and structure 
of the male antennae, respectively (Hansson et al. 1987; 
Roelofs et al. 1987; Dopman et al. 2004). The accumula- 
tion of genetic and phenotypic differences between E- and 
Z-race adults may lead to preferential mating within each 
race. Regardless, hybridization occurs in both the field 
and laboratory, where pairings of £-race male with Z-race 
females tend to be more frequent than the reciprocal 
cross (Liebherr and Roelofs 1975; Linn et al. 1997; 
Pelozuelo et al. 2007). Male attraction to females of the 
opposite pheromone race decreases significantly with 
increasing distances (Dopman et al. 2009), and successful 
mating may also depend upon recognition of a male 
courtship pheromone at close range as well as courtship 
behaviors and signaling (Lassance and Lofstedt 2009; Ta- 
kanashi et al. 2010). Despite chemical and behavioral 
mechanisms of prereproductive isolation between E- and 
Z-races, accurate estimation of the strength of this repro- 
ductive barrier in natural populations has been elusive 
(see Introduction). 



Hybridization of E- and Z-race O. nubilalis 

Limited gene flow has been detected between O. nubil- 
alis pheromone races using anonymous genetic markers 
(Harrison and Vawter 1977; Carde et al. 1978; Cianchi 
et al. 1980; Glover et al. 1991). Willett and Harrison 
(1999) concluded from analysis of variation in the phero- 
mone binding protein gene that gene flow occurs between 
the E- and Z-races in New York. Hybrid female frequen- 
cies of 5%, 11%, and 12% were estimated at three loca- 
tions in North Carolina (Durant et al. 1995), and 0-22% 
was observed among sites in New York (Roelofs et al. 
1985), but estimates were based on GC analysis of a lim- 
ited number of pheromone glands. Up to this point, esti- 
mation of hybrid formation within field populations has 
remained an impasse for population genetic studies due 
to difficulties in performing GC analysis upon dissected 
pheromone glands of wild caught virgin females. The 
novelty and utility of the current study is that molecular 
genetic markers developed herein discriminate pgfar-e and 
pgfar-z alleles within field populations, and for the first 
time offer a relatively high throughput method for identi- 
fying phenotypes for population genetic studies. The 
markers were applied to detect pgfar-e/pgfar-z heterozyg- 
otes, and subsequently to estimate hybridization fre- 
quency between E- and Z-race O. nubilalis. Genotyped 
individuals {n = 331) from light trap samples collected in 
regions of the United States with sympatric E- and Z-race 
populations indicated hybridization frequencies ranged 
from 0 to 41.9%. The assortative mating model assumes 
that attraction between E- and Z-race O. nubilalis is rare 
(Roelofs et al. 1987), but our genotype data suggest that 
intermating of E- and Z-races can occur at relatively high 
frequencies in natural populations in the Eastern United 
States. Given that mate attraction among pheromone 
races is weak at long distances (Dopman et al. 2009), the 
breakdown in assortative mating in the field may take 
place at close distances. Adults tend to concentrate in 
grassy "aggregation sites" during the night for mating 
(Showers et al. 1974, 1976), which perhaps would pro- 
mote close-range encounters between E- and Z-race 
moths if temporal mating periods overlap. 

Given the observed moderate to high levels of hybrid- 
ization, assortative mating may not be the only factor 
reinforcing the genetic isolation between E- and Z-races. 
Females with 70, 72, and 73% £ll-14:OAc were pre- 
viously detected in New York, and may be evidence that 
backcross females are produced in natural populations 
(Roelofs et al. 1985), but genetic tests for backcross 
females remain undeveloped pending the identification of 
modifier loci (Lofstedt et al. 1989; Zhu et al. 1996). Fur- 
thermore, the reproductive fate of hybrid males remains 
obscure. Glover et al. (1991) showed that most heterozy- 
gous (Fi hybrid) males derived from reciprocal cross do 
not respond to pheromone blends produced by either E- 
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or Z- race females. The infrequent hybrids that did 
respond showed no pheromone blend preference. Thus, 
Fi hybrid males might be a genetic dead end that are 
incapable of mating (Lassance 2010), and might impose a 
partial barrier to the introgression of genes between pher- 
omone races. The development and application of mark- 
ers for the male Resp locus (Dopman et al. 2005) as well 
as genes involved in the modification of female phero- 
mone blend ratios among backcrosses in O. nubilalis 
(Lofstedt et al. 1989; Zhu et al. 1996) will likely be 
important in understanding direction of gene flow 
and the reproductive fate of hybrid within natural 
populations. 
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